Prediction of signal peptides in archaea.
نویسندگان
چکیده
Computational prediction of signal peptides (SPs) and their cleavage sites is of great importance in computational biology; however, currently there is no available method capable of predicting reliably the SPs of archaea, due to the limited amount of experimentally verified proteins with SPs. We performed an extensive literature search in order to identify archaeal proteins having experimentally verified SP and managed to find 69 such proteins, the largest number ever reported. A detailed analysis of these sequences revealed some unique features of the SPs of archaea, such as the unique amino acid composition of the hydrophobic region with a higher than expected occurrence of isoleucine, and a cleavage site resembling more the sequences of gram-positives with almost equal amounts of alanine and valine at the position-3 before the cleavage site and a dominant alanine at position-1, followed in abundance by serine and glycine. Using these proteins as a training set, we trained a hidden Markov model method that predicts the presence of the SPs and their cleavage sites and also discriminates such proteins from cytoplasmic and transmembrane ones. The method performs satisfactorily, yielding a 35-fold cross-validation procedure, a sensitivity of 100% and specificity 98.41% with the Matthews' correlation coefficient being equal to 0.964. This particular method is currently the only available method for the prediction of secretory SPs in archaea, and performs consistently and significantly better compared with other available predictors that were trained on sequences of eukaryotic or bacterial origin. Searching 48 completely sequenced archaeal genomes we identified 9437 putative SPs. The method, PRED-SIGNAL, and the results are freely available for academic users at http://bioinformatics.biol.uoa.gr/PRED-SIGNAL/ and we anticipate that it will be a valuable tool for the computational analysis of archaeal genomes.
منابع مشابه
Machine learning approaches for the prediction of signal peptides and other protein sorting signals.
Prediction of protein sorting signals from the sequence of amino acids has great importance in the field of proteomics today. Recently, the growth of protein databases, combined with machine learning approaches, such as neural networks and hidden Markov models, have made it possible to achieve a level of reliability where practical use in, for example automatic database annotation is feasible. ...
متن کاملCombined prediction of Tat and Sec signal peptides with hidden Markov models
MOTIVATION Computational prediction of signal peptides is of great importance in computational biology. In addition to the general secretory pathway (Sec), Bacteria, Archaea and chloroplasts possess another major pathway that utilizes the Twin-Arginine translocase (Tat), which recognizes longer and less hydrophobic signal peptides carrying a distinctive pattern of two consecutive Arginines (RR)...
متن کاملPeriplasmic expression of Bacillus thermocatenulatus lipase in Escherichia coli in presence of different signal sequences
Efforts to express lipase in the periplasmic space of Escherichia coli have so far been unsuccessful andmost of the expressed recombinant lipases accumulate in the insoluble cell fraction. To evaluate the role ofnative and heterologous signal peptides in translocation of the lipase across the inner membrane of E. coli,the lipase gene (btl2) was cloned downstream of the native ...
متن کاملIn silico prediction of anticancer peptides by TRAINER tool
Cancer is one of the causes of death in the world. Several treatment methods exist against cancer cells such as radiotherapy and chemotherapy. Since traditional methods have side effects on normal cells and are expensive, identification and developing a new method to cancer therapy is very important. Antimicrobial peptides, present in a wide variety of organisms, such as plants, amphibians and ...
متن کاملAntimicrobial Peptides of Innate Immune System as a Suitable Compound for Cancer Treatment and Reduction of its Related Infectious Disease
Application of chemotherapy in cancerous children leads to reduction of immune system efficiency. Therefore, these children are prone to various infectious diseases. The excessive use of antibiotics can bring about antibiotic resistant strains. Hence, it is essential to investigate new therapies for this problem. On the other hand, the emergence of resistance against multiple drugs is a major p...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Protein engineering, design & selection : PEDS
دوره 22 1 شماره
صفحات -
تاریخ انتشار 2009